An Automatic Annotation Technique for Web Search Results

نویسندگان

  • Wei Liu
  • Xiaofeng Meng
  • Weiyi Meng
  • Sridhar Reddy
  • Raja Jacob
  • Arvind Arasu
  • Hector Garcia-Molina
  • Yanhong Zhai
  • Bing Liu
  • Praveen Sundar
  • Hongjun Lu
  • Jungwha Hong
چکیده

The uses of web search engines are very frequent and common worldwide over the internet by end users for different purposes. A web search engine takes the query request from the end user and executes that query on relational database used to store the information on behalf of that web search engine. Based on input queries the dynamic response is generated by search engine, in the form of HTML based pages. Such pages are supported with the web databases. Every web page generated contains many results to display for particular query, called as Search Result Records (SRRs). Sometimes it becomes troublesome to extract relevant data from diverse sources. The SRRs generated may contain data units that are relevant to one common semantic. These SRRs are further required to be assigned with proper labels. The manual methods for record extraction and labeling have a worse scalability. Thus automatic annotation based method is needed to improve the accuracy as well as scalability of web search engines. This paper presents an automatic annotation technique for web search results. The proposed approach first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, each group is annotated from different aspects and aggregates the different annotations to predict a final annotation label for it. The annotation wrapper generated for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Experiments indicate that

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

Review on Automatic Annotation of Query Results from Deep Web Database

In recent years, web database extraction and annotation has received much attention from the database and Information Extraction(IE) in research area due to the volume and quality of deep web. Many web databases are accessible through HTML formbased interface. When query is submitted to the search interface the query result page is generated. Search Result Records(SRRs) are the result pages obt...

متن کامل

Probability Association Approach in Automatic Image Annotation

introduction Content-based image retrieval (CBIR) has wide applications in public life. Either from a static image database or from the Web, one can search for a specific image, generally browse to make an interactive choice, and search for a picture to go with a broad story or to illustrate a document. Although CBIR has been well studied, it is still a challenging problem to search for images ...

متن کامل

Web Image Search by Automatic Image Annotation and Translation

There has been a growing interest in implementing online Web image search engine in the semantic level. However, most existing practical systems including popular commercial Web image search engines like Google and Yahoo! are either textbased or a simple hybrid of texts and visual features. This paper proposes a novel Web Image Search by Automatic Image Annotation and Translation (WISAIAT) syst...

متن کامل

Annotation-Based Automatic Action Processing

With a strong motivational background in search engine optimization the amount of structured data on the web is growing rapidly. The main search engine providers are promising great increase in visibility through annotation of the web page’s content with the vocabulary of schema.org and thus providing it as structured data. But besides the usage by search engines the data can be used in various...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016